Interlinking Developer Identities Within and Across Open Source Projects: The Linked Data Approach
نویسندگان
چکیده
Software developers use various software repositories in order to interact with each other or to solve related problems. These repositories provide a rich source of information for a wide range of tasks. However, one issue to overcome in order to make this information useful is the identification and interlinking of multiple identities of developers. In this paper, we propose a Linked Data-based methodology to interlink and integrate multiple identities of a developer found in different software repositories of a project as well as across repositories of multiple projects. By providing such interlinking will enable us to keep track of a developer’s activity not only within a single project but also across multiple projects. The methodology will be presented in general and applied to 5 Apache projects as a case study. Further, we show that the few methods suggested so far are not always appropriate to overcome the developer identification problem. Keywords-Linked Data; Software Engineering; FLOSS Repositories; Data Integration; Developer Identities
منابع مشابه
Examining Turnover in Open Source Software Projects Using Logistic Hierarchical Linear Modeling Approach
Developer turnover in open source software projects is a critical and insufficiently researched problem. Previous research has focused on understanding the developer motivations to contribute using either the individual developer perspective or the project perspective. In this exploratory study we argue that because the developers are embedded in projects it is imperative to include both perspe...
متن کاملInterlinking Cross-Lingual RDF Data Sets
Linked Open Data is an essential part of the Semantic Web. More and more data sets are published in natural languages comprising not only English but other languages as well. It becomes necessary to link the same entities distributed across different RDF data sets. This paper is an initial outline of the research to be conducted on cross-lingual RDF data set interlinking, and it presents severa...
متن کاملThe KnowledgeStore: A Storage Framework for Interlinking Unstructured and Structured Knowledge
Although the quantity of structured information on the Web and within organizations is increasing, the majority of information remains available only in unstructured form. While different in form, both unstructured and structured information sources provide information about entities in the world and their properties and relations; still, frameworks for their seamless integration have not been ...
متن کاملLinking a Community Platform to the Linked Open Data Cloud
Linked Data promises access to a vast amount of resources for learners and teachers. Various research projects have focused on providing educational resources as Linked Data. In many of these projects the focus has been on interoperability of metadata and on linking them into the linked data cloud. In this paper we focus on the community aspect. We start from the observation that sharing data i...
متن کاملEmergence of New Project Teams from Open Source Software Developer Networks: Impact of Prior Collaboration Ties
Software development has traditionally been regarded as an activity that can only be effectively conducted and managed within a firm setting. However, contrary to such assertions, the open source software development (OSSD) approach, in which software developers in Internet-based communities coordinate to voluntarily contribute programming code, has recently emerged as a promising alternative t...
متن کامل